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We consider a particular Cox process from a Bayesian viewpoint 
and show that the Bayes estimator of the intensity measure is the 
so-called Polya sum kernel, which occurred recently in the context 
of the construction of the so-called Papangelou processes. More 
precisely, if the prior, the directing measure of the Cox process, is 
a Poisson-Gamma random measure, then the posterior is again a 
Poisson-Gamma random measure and the Bayes estimator of the 
intensity is the Polya sum kernel. Moreover, we extend this result to 
doubly stochastic Poisson-Gamma priors and give conditions under 
which one can identify the Bayes estimator for the intensity. 
Keywords: Point process, Campbell measure. Cox process, Polya 
sum process, Papangelou process, Bayes estimator 

1 Introduction 

Given some statistical model of point processes, the interest lies in deriving 
statements about unknown parameters from observations, which are point con- 
figurations. For Poisson processes on an Euclidean space it is possible to de- 
termine the intensity measure among the stationary ones from a single obser- 
vation [11]. In a Bayesian context, one starts with a probability distribution 
on the set of parameters, which may be interpreted as prior information or 
a degree of belief on the set of models, and is interested in firstly the law of 
the parameter given some observations, the posterior law, and secondly in the 
estimator for the parameter. 
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Staying in the context of Poisson processes, any choice of a prior distribution 
on the set of stationary intensity measures leads to degenerate posteriors in the 
sense that they are concentrated on a single intensity measure. For a larger 
parameter set, when such perfect estimates are not available, one is interested 
in finding suitable sets of priors, as discussed e.g. in [1]: Desirable properties 
are analytical tractability in the sense that it should be possible to determine 
the posterior law given some observation analytically, and that together with 
the prior, the posterior should belong to the same class of distributions. In this 
case the set of priors is said to be closed under sampling or conjugate. 

The questions considered in [11] are strongly connected to Bayesian statistics, 
they were discussed in an abstract form in [4, 3]: In terms of point processes one 
starts with a consistent family of local specifications, i.e. local laws, and aims 
at constructing firstly all stochastic fields which arc specified by this family, and 
secondly integral representations of these point processes as mixtures of certain 
extremal elements. Once the integral representation is obtained, interpreting 
the mixing measure as a prior, a single observation is sufficient to determine the 
particular extremal element, this property was called ergodic decomposability 
in [5] and is equivalent to the posterior being a Dirac measure for almost every 
observation. 

Returning to the conjugated classes of priors, one important example is the 
class of Gamma distributions as priors for the family of Poisson distributions: 
If the prior is T{a,r) distributed, then the posterior given the observation k is 
r(a + 1, r + /c) distributed. Hence the family of Gamma distributions is closed 
under sampling for Poisson models. Of course, this does not touch the question 
whether the family of Gamma distributions is a natural choice. 

We generalize this result to general random measures and point processes: 
Starting with a Poisson process and its intensity measure being a Gamma- 
Poisson random measure, the posterior is again a Gamma-Poisson random 
measure. More precisely, if the prior is the Gamma-Poisson random measure 
with parameters a € M+ and p a cr-finite measure, then the posterior given 
the observation /x, a point configuration with possibly multiple points, has pa- 
rameters a + 1 and p + /J,- In particular the class of Gamma-Poisson random 
measures is such a big class that there is no chance to get more precise informar 
tion about the directing intensity measure from a single observation. Moreover 
we show that the Bayes estimator for the underlying intensity is z(p+p), where 
z depends on a in a unique way. 

A similar generalization from Dirichlet distributions for multinomial mod- 
ells are Dirichlet processes for a point processes realizing a fixed number of 
points [15]. 

In fact, the subsequently presented ideas reverse the original considerations 
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of Zcssin [16]: Is there a point process which places its points according to a 
Polya urn mechanism instead of simple urn mechanism like the Poisson process. 
Zessin constructed this so-called Polya sum process as is the unique point 
process with Papangelou kernel z{p + n), that is that the Polya sum process 
is the unique solution of a particular integration-by-parts formula. Intuitively, 
the Papangelou kernel can be understood as a conditional intensity at which 
points are placed given the observation /x. In [13] this Polya sum process was 
identified as a Cox process, the underlying random intensity being a Gamma- 
Poisson random measure. This connection is subject of section 2. 

In a further step we allow the parameters being random and then consider 
Cox processes directed by doubly stochastic Gamma-Poisson random measures. 
Equivalently, we consider doubly stochastic Polya sum processes. We put the 
results in [14] into the Baycsian context: Point processes with local laws given 
by conditioned Polya sum processes were identified as certain mixed Polya 
sum processes, hence again one has an integral representation applicable for 
Bayesian analysis: Firstly, for any prior which is concentrated on a certain 
parameter set, the posterior will be concentrated on a single point and therefore 
determines the parameters uniquely, hence Bayes kernel and estimator of the 
intensity can be identified explicitly. Secondly, using the Cox representation 
of the Polya sum process, we obtain the results of the first part with a doubly 
stochastic Poisson-Gamma process as directing measure, a result in the spirit 
of [1] for mixed Dirichlct processes. 

Implications of these results are that for particular cases we obtain that the 
doubly stochastic Polya sum process is again a Papangelou process, and more- 
over that each point process satisfying the same integration-by-parts formula 
must be a doubly stochastic Polya sum process. These ideas are presented in 
section 3, the proofs are contained in section 4. 

2 Some random measures, point processes and results 

Let X be a polish space and denote hy B = B{X) its Borel sets as well as by 
Bq = Bo{X) the ring of bounded Borel sets of X. Furthermore let A4{X) and 
M."{X) be the space of locally finite measures and locally finite point measures 
on X, respectively, each of which is vaguely polish, the cr-algebras generated 
by the evaluation mappings Cb(^) = f^iB), B E Bq. M"{X) is the set of 
observable point configurations, i.e. locally finite subsets of X with possibly 
multiple points, and M.{X) is the set of 'mass distributions' with finite mass 
in each bounded set. CBifJ-) counts the number of points of /x G M"{X) inside 
B considering possible multiplicities. 
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A probability measure P on Ai{X) is a random measure, and if P is con- 
centrated on Ai"{X), a point process. Finally denote by F{X) be the set of 
bounded, non-negative and measurable functions on X and Fii{X) C F{X) 
the subset of those functions in F{X) with bounded support. Following the 
notation Cb(/^) = /"(-B) denote by C/(/") = = / /d/i for / G F{X) and a 
measure /x G M{X) the evaluaton mapping of / at /x. 

Apart from the finite dimensional distributions, to characterize a random 
measure or point process P uniquely, one may either use the Laplace transform 

Mf) = J e-^^dP, feF{X) 

or the the Campbell measure 

Cp{h) = J J h{x, n)n{dx)P{dn), h G F{X x M"iX)). 

The Campbell measure admits under suitable assumptions two disintegrations, 
one with respect to its intensity measure yielding Palm kernels, and one with 
respect to P itself yielding the Papangelou kernels. The famous example is the 
Poisson process Pp with intensity measure p € A4{X), for which 

C^{h) = jj h{x,^i + S^)p{dx)Pp{dfi), heF{Xx M"{X)). (2.1) 

Moreover, there is exactly one point process which satisfies equation (2.1), 
known as Mecke's characterization of the Poisson process. The kernel ri{p, dx) = 
p{dx) does not depend on the configuration p, meaning that the point process 
places points independent of all other points with the same distribution. 

In analogy to inductive rules defining a law given some observation, one 
should read equation (2.1) as given any observed point configuration p, the 
intensity for another point is given by the measure p; and moreover, if some 
point process is given by this rule, it must be the Poisson process. In a similar 
way the following modification has to be understood with additional rewards 
to observed points. Note that neither every choice of a rule specifies a point 
process, nor in case of existence this point process must be unique. 

Recently, Zessin considered in [16] the Papangelou kernel 

r]{p, dx) = z[p + p) (dx) 

for a measure p G A4{X) and some z G (0, 1), which replaces the urn mech- 
anism with replacement of the Poisson process by a Polya urn mechanism. 
Instead of a point being placed according to the intensity p independent of a 
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present configuration, here points in the configuration fi get a reward for the 
intensity of the following point. Zessin answered the question of the existence 
and uniqueness of a point process satisfying the functional equation 

Cpih) = z JJ h{x,fi + S^){p + fi){dx)P{dfi), heF. (2.2) 

Again, there is exactly one solution, the Polya sum process %^p. This point 
process has like the Poisson process independent increments and is infinitely 
divisible. In contrast to the Poisson process, Sz^p is not a simple point process 
even if /> is a diffuse mesaure. Moreover, S^^p has a representation as a Cox 
process [13] with its underlying random intensity measure being a Poisson- 
Gamma random measure, see e.g. [10] for the latter process. More precisely, if 
Dz,p is the infinitely divisible random measure with its Levy measure x{p'^'''z) 
being the image of the product of p and 



Tz{dr) = - e z ''dr, r > 



r 



under the mapping x : X X M+ — )■ M{X), {x,r) i-)- r6x, then the Cox repre- 
sentation of the Polya sum process is 



= j P^D.^pidK). (2.3) 

The parameter a in the introduction and the parameter z are linked via the 
relation a = 

We interpret D^^p as a prior for the measurable family of Poisson processes. 
Following [5], let the Bayes kernel B from M"{X) to M{X) be 

or equivalently be defined via 

J j /(/x,/c)P,(dM)D,,p(dK) = jj f{p,K)B(fi,dK)%,pidtx). (2.4) 

Theorem 2.1. Let z G (0,1) and p G A4{X). Then the posteriori measure 
of the Polya sum process %^p is a Poisson-Gamma random measure, more 
precisely 

B{p, ■ ) = D_^,p+^. 
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Note that if wc write z' = j^, then there corresponds some a' to z' and one 
has a' = a + 1. Thus the transformation of the parameters is in the same spirit 
as in the random variable case. Remark that -B(//, •) is the superposition of 
two random measures 

Corollary 2.2. Under the assumptions of Theorem 2.1, we have 



Even more, since is a point measure, the second measure itself is a super- 
position of random measures concentrated on single points. 

Knowing B^fi, ■ ), the Bayes estimator of the (random) intensity measure for 
the Poisson process given the observation /z G M"{X) is 



we immediatly identify the Bayes estimator as the intensity measure oi B{fj,, ■). 

Corollary 2.3. Let z G (0, 1) and p G M.{X). Then the Bayes estimator given 
an observation p G M"{X) is 



Hence the Bayes estimator is exactly the Polya sum kernel. The sum exactly 
reflects the representation of • ) as the convolution of two random mea- 
sures. In such a case the intensity measure necessarily is the sum of the two 
intensity measures. Thus if one ignored (2.3) and denotes the point process on 
the rhs. by P, then for any non-negative, measurable function h 



by applying firstly Mecke's characterization of the Poisson process and (2.4). 
Since the last equation has the unique solution S^^p, P must be the Polya sum 
process. 



B{p, ■) = „*D^ 




h[p]=z{p + p) 



-a.s. 
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Remark 2.4. 1. Instead of a fixed z G (0,1) one may start with a measurable 
function z : X ^ (0,1). As long as z is bounded away from 1, calculations 
do not need further justifications. 

2. If X is countable, p is the counting measure on X and z, with the just 
mentioned restrictions, is a probability measure on X, then we are in 
the case of the Bosc gas as in [2]. Given an observation of particles, the 
Bayes estimator of the intensity of particles occurring in certain states is 
the special Polya sum kernel. 

3 Doubly stochastic Polya sum processes 

As indicated in the introduction, the construction of H-sufficient statistics in [3] 
for a certain set of probability measures C fits into the context of Bayesian 
statistics and has implications in connection with [13]. At first we briefly 
describe the considered problem and the results obtained. 
Denote by C the set of all point processes P satisfying 

n-\SB) = %,pi-\£B) P-a.s. (3.1) 

for each bounded set B £ B. £'b is a cr-algebra containing few information 
about the events inside the bounded set B and full information about the 
events outside B, such that if B' contains B, then S^' is contained in £b- We 
will be more precise in a moment and only remark that p is assumed to be 
a diffuse and infinite measure. Such a P is called stochastic field with local 
characteristic given by the rhs. of (3.1). 

In [13] there was constructed a stochastic kernel Q from A4"{X) to A4"{X) 
satisfying 

F = J Q{p, ■)F{dp) (3.2) 

for any P € C and moreover that 

Q(^fi,{fi' -.Qip', ■) = Qip, •)}) 

holds. In fact, Q is the common conditional probability of P conditioned on 
the asymptotic a-algebra S^o = flseBo every P G C. In the terminology 

of [3], Q is an H-sufficient statistic for C, in the terminology of [5], Q is a 
decomposing kernel and C an ergodically decomposable simplex. 

From [3] it follows that there exists a subset A C C of extreme points and a 
unique probability measure Vp on A such that 

P = [ PVp{dP). (3.3) 

J A 
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Comparing equations (3.3) and (3.2), (5(/x, • ) is the posterior of Vp given the 
observation 

So far this is the essence of [3] and [5], the in [13] discussed choices of 
(^b)b6Bo l^^d to the extremal points: 

1. i{ £b admits the counts the points inside B with multiphcity, then 

A = {S,,p:zG [0,1)}, 

2. if £b admits the counts the points inside B without multiplicity, then 

A = {S,,wp ■■ w G [0,+oo)}, 

3. if £b admits the counts the points inside B with and without multiplicity, 
then 

A = {Sz,wp : z e {0, l),w G (0, +oo) or z = w = 0}. 
For any of these cases, Q is identified as 

Qifi, ■ ) = ^{^i),WiiJ.)p, 

where Z and W are uniquely determined from the densities of points with and 
without multiplicity (in the first two cases one of them is fixed). Note that 
since we assumed p to be an infinite measure, these densities are almost surely 
constant for each extremal point. We remain in the setup of the last case, i.e. 
start with any priori distribution V on (0, 1) x M{X) and consider the doubly 
stochastic Poly a sum process directed by V. 

Remark 3.1. For any priori distribution V on (0,1) x Ai{X), the doubly 
stochastic Polya sum process Sv = / Sz,pV{dz, dp) is a Cox process Py directed 
by the doubly stochastic Poisson-Gamma random measure Dy = J Dz^pV{dz, dp) 

Theorem 3.2. Let V be a priori distribution on (0,1) x M{X). If V is 

concentrated on (0, 1) x {wpo : w G M+j U {(0, 0)} for some infinite and diffuse 
measure pQ G AA{X), then 

1. V is ergodically decomposable for the family {Sz^wpo '■ z G (0,1), w G 
(0, +00) or z = w = 0} and the Bayes kernel for the parameters is 

^y^^''-^ = \zi,),w(,)poy 

where is the unique solution of the equations 

2; 

w- = U{n), -w\og{l- z) = V{n) 

I — z 

and U{ij) and V{p) are the perfect estimates of the density of points with 
and without multiplicity, respectively. Z and W are £^-measurable. 
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2. The Bayes kernel By of the Cox process Py is given by 



hence the Bayes estimator for the random intensity measure is 



bv[fi] = Z{fi){W{fi)po + fi). 



Thus under the assumptions of Theorem 3.2 we have shown that the Cox 
process Py is a Papangelou process: 

Corollary 3.3. Let V be a distribution on (0, 1) x {wpo : w G M+} U {(0,0)} 
for some infinite and diffuse measure po G M(X). Then Sj/ is a solution of 
the partial integration formula 



and Z and W are £oo-measurable random variables. Moreover, any solution 
of (3.4) with Soo-measurable Z and W and infinite and diffuse measure po G 
A4{X) is a mixed Poly a sum process. 

Remark 3.4. 1. Despite that a priori By and by depend on the probability 
measure V, due to the ergodic decomposability of V they do not. Hence 
any choice of V yields a Cox process Py (a mixed Polya sum process 
Sy) which solves equation (3.4) with a single Papangelou kernel. Hence 
we derived and discussed an integration-by-parts formula which does not 
have a unique solution. 

2. The fact that in Corollary 3.3 Sy solves equation (3.4) is a straight forward 

application of the integration-by-parts formula for the Polya sum process 
together with the ergodic decomposability of V. The latter property 
allows to estimate the parameters z and w from an infinite configuration 
of points. This does not depend on choice of mixed Polya sum processes: 
In fact, if the Polya sum processes are replaced by Papangelou processes, 
and if they are mixed with respect to an ergodically decomposable mixing 
measure, then an equation analogue to (3.4) holds for the mixture and 
the mixed Papangelou process is a Papangelou process itsself. 

4 Proofs 

We identify B{p, • ) by computing the Laplace transforms of both sides of 
equation (2.4). This shows Theorem 2.1 Before recall, e.g. from [10], that the 



Cp{h) = // h{x,fi + d^)Z{fi){W{fi)po + p){dx)P{dp) 




(3.4) 
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Laplace transform of the Poisson Gamma random measure with Levy measure 



LD.Ah) = exp 



// 



1 _ Q-rh{x) 



Tz{dr)p{dx) 



h G F{X), 



which by the relation log 
a = turns into 



1 + 



J- 1 _ e-ry r,(dr) = /~ [1 - e"''^] i e— dr, 



Lo.Ah) = exp 



f , zh(x) 
/ log 1 + -^ 
Jx I 1-2; 



p{dx) 



h G F{X). (4.1) 



Secondly note that the Laplace transform of the Polya sum process is 

1 _ g-f (^) 



Ls^Js) =exp 



log 



1 + z- 



1-z 



p{dx) 



g^F{X). (4.2) 



Lemma 4.1. Let z G (0, 1) and p G M{X). Then for all g,h e F{X), 



e-Cg®h(M,«) l> (d;Lt)D2,p(dK) = exp 



log 



1 + ^- ^ 

1 — z 



dp 



Proof. Note that the inner integral on the Ihs. is the Laplace transform of the 
Poisson process with intensity measure k, hence 

1 -e-f dK ] . 



/e-C.dR^exp(-/l-e-»d„) 



Thus what remains ist is the Laplace transform of D^;,^ at 1 — e ^ +h. But this 
is by equation (4.1) 



exp 



log 



zTl -e-9(^)+/i(a 

1 + ^ ^ ^ 

1 — z 



p{dx) 



□ 



Next compute the rhs. of equation (2.4) 



Lemma 4.2. Let z G (0, 1) and p G M{X). Then with z' = holds for all 

g,heF{X) 



Q-Cg^hM B{p,dK)S,^p{dp) = exp 



log 



1 + ^- ^ 

1 — z 



dp 
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Proof. We check the Ansatz • ) = Dz'^p+n- By equation (4.1), 



log 



1 + 



z'h{x) 
1-z' 



exp 



{p + P) (dx 
/ \og[l + zh{x)\ (p + /x)(dx) 



Therefore we get two exponentials, where for the integration with respect to 
Sz^p only the integral with respect to matters. But this just is the Laplace 
transform of evaluated at g + log[l + z/i], hence 



jj Q-^amM B{ii,dK)%,p{dii) 



exp 



exp 



— j log [1 + zh] dp exp — j 



log 



1 + 



1-z 



log 



dp 



1 + 2- 



l _ g-g-log[l+z/i] 
1-Z 



dp 



□ 



Finally to get Corollary 2.3, note that for any h G Fi,{X), the intensity 



can be computed from the Campbell measure 



/// 



{h(E)l) 

h{x) e-'/' dr (p + p) (dx)D_^,,+^(dK) 
/i(a;)(p + p)(da;) = 6[p](/i). 



The first statement of Theorem 3.2 is a reformulation of the Theorem in [14]. 
The second part follows from the observation that for a non-negative, measur- 
able function g by the application of Theorem 2.1 and the ergodic decompos- 
ability 



/ 



£?(p, K, z, wp)PK{diJ,)Dz^yjp{dK)V{dz, dw) 

g{lJ,,K,z,wp)D (d/c)Sfe,^p(dp)y(d2;, dw) 

g{p, K, z, wp)Dz'(f,),win)p+n{dK)S,^yjp{dii)V{dz, dw). 



I- 



Dropping the dependence of g on the last two arguments, we get the second 
part of Theorem 3.2. 
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Proof of Corollary 3.3. Any mixed Polya sum process Si/ solves the partial 
integration formula (3.4). Now assume that P is any solution of (3.4), then the 
joint Laplace transform Z,W and P is 

Lz,wA^,v,tf) = P (e-^-^-*C/) = P (e-«^-^ P (e-*C/ l^,,)) 

by conditioning on S^o- Denote by Poo the conditioned point process P. Dif- 
ferentiation wrt t yields the Campbell measure of Poo, which allows to identify 
this conditional measure, thus on the one hand 

~Lz,wA^,v,tf) = P (e-"«Cp^ {f^e-'if)) . 

On the other hand, 

- ^^Lz,wAu,v,tf) = Cp (/®e-"^**^/) 

= P ^e-"^-^^ J I fix) e-*^(-^)-*^W Z{Wp + n) (dx)Poo(d/x) 

Thus Poo satisfies P-a.s. the functional equation (2.2) and therefore is a Polya 
sum process with the parameters given by Z and Wp. But then immediatly P 
is a mixture of these processes. □ 
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